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ABSTRACT 

In this paper, we introduce PICACS, a physically-motivated, internally consistent 
model of scaling relations between galaxy cluster masses and their observable proper- 
ties. This model can be used to constrain simultaneously the form, scatter (including 
its covariance) and evolution of the scaling relations, as well as the masses of the indi- 
vidual clusters. In this framework, scaling relations between observables (such as that 
between X-ray luminosity and temperature) are modelled explicitly in terms of the 
fundamental mass-observable scaling relations, and so are fully constrained without 
being fit directly. We apply the PICACS model to several observational datasets, and 
show that it performs as well as traditional regression methods for simply measuring 
scaling relation parameters, but presents several significant advantages. For clusters 
with available X-ray hydrostatic masses, PICACS gives a modest improvement of the 
precision of the mass estimates, while consistently constraining the mass-observable 
scaling relations. For a sample of clusters without prior mass estimates, we derive self- 
consistent constraints on the cluster masses and scaling relations, and find a minor 
improvement in precision on cluster mass estimates compared with a single scaling re- 
lation. We are also able to deconstruct the slope of the luminosity-temperature {LT) 
relation and show that the steepening compared to self-similar expectations is due 
to contributions from heating and depletion of the gas within the reference radius 
i?500, and not due to a mass dependence of the gas structure within that radius. 
Finally, we use PICACS to illustrate the dependence of the expected self-similar evo- 
lution of the LT relation on the slopes of the mass scaling relations, and show that 
our self-consistent modelling predicts self-similar evolution significantly weaker than 
is commonly assumed. 

Key words: cosmology: observations - galaxies: clusters: general - galaxies: high- 
redshift - intergalactic medium - X-rays: galaxies 



> 1 INTRODUCTION 



X 



Simple theoretical arguments lead to an expectation of 
power-law scaling relations between the masses of galaxy 
clusters and their observable properties ( Kaiser] 1986 Bryan 
fc Norman|1998 1 . These scaling relations have been the sub- 



ject of a great deal of attention, in particular those involv- 
ing X-ray observations of the properties of the intra-cluster 
medium (ICM) (e.g. Finoguenov, Reiprich & Bohringer 



2001 1 'Reipri ch fc Bohringer] |2002[ [Sanderso n et al.||2003[ 



Vikhlinin et al.||2003[ ). These X-ray scaling relations are of 



interest, as correctly modelling the forms of the scaling rela- 
tions tests our understanding of the physical processes that 
heat and shape the ICM over cluster lifetimes. Furthermore, 
if the forms of the scaling relations are known to some pre- 
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cision, then they provide an efficient tool to estimate cluster 
masses in the absence of detailed data to allow, for instance, 
an X-ray hydrostatic mass analysis. 

When mass estimates are not available for clusters, it is 
also common to study the correlations between X-ray prop- 
erties as a way to gain insight into ICM physics. For this 
reason, the X-ray luminosity-temperature {LT) relation has 
been extensively studied (e.g. Mitchell et al.||1979 Edge & 



Stewart 1 99"T||Markevitch|1998| [Pratt et al.|2009||Maughan 
et al. 20121. It is widely found that the slope of the LT 



relation is steeper than that expected if gravitational heat- 
ing of the ICM were the only significant influence (but see 
Maughan et al. 2012 for a notable exception). Successful 
models of additional ICM physics are then expected to ex- 
plain the steepening of the LT relation. In this paper, we will 
argue that fitting models of the LT relation to observations 
of clusters and investigating departures from self-similarity 
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is not optimal. A much better approach is to jointly model 
the scaling relations between cluster observables and their 
masses, and use those to dictate the form of the LT relation. 

The usual approach in these endeavours is to model each 
of the scaling relations independently using a form of lin- 
ear regression. Perhaps the most popular form is the BCES 
method, which accounts for errors in the x and y variables. 



and intrinsic scatter in the population being modelled ( Akri 



tas & Bershady 19961. More recently, Bayesian techniques 



have been employed to allow greater flexibility in modelling 



(Kelly 2007 Andreon & Hurn 20101, but these are usu 



ally only employed to look at individual scaling relations. 
Bayesian techniques are now commonly used in cosmologi- 
cal studies, with many of these jointly modelling one or more 
mass-observable scaling relation (e.g. Rozo et al.|2009 Ben- 
son et al.|2011 1. The most advanced treatment of the X-ray 
scaling relations thus far is the self-consistent modelling of 
the scaling relations and mass function for a large sample 



of clusters by Mantz et al. (2010b|a|, which included joint 



modelling of the mass scaling of L and T. 

There is far more information in the cluster datasets 
than is typically used in studies of the scaling relations of 
cluster populations. In this paper, we present a method 
of Physically-motivated, Internally Consistent Analysis of 
Cluster Scaling (PICACS) that jointly constrains the form 
of the scaling relations between different observables and 
cluster masses. This maximises the use of the observational 
data, provides new information on the extent to which differ- 
ent observable properties depart from self-similar behaviour, 
and gives improved mass estimates for individual clusters. 

The paper is laid out as follows. In !j2]we derive the set 
of scaling relations used to model the cluster population, 
and then present the statistical framework used to imple- 
ment the model in i|3] We then apply the new technique to 
observed samples of galaxy clusters with individual hydro- 
static mass measurements (Q and without mass estimates 
(ij5|. We examine the implications of our results for estimat- 
ing clusters masses in !|6] and for dissecting the slope and 
evolution of the LT relation in ^ Throughout the paper, 
we assume a ACDM cosmology with fiji/ = 0.3, = 0.7 
and Ho = 70 km s"^ Mpc"^ . 



2 SELF-CONSISTENT SCALING RELATIONS 

The galaxy cluster X-ray scaling relations were proposed by 
|Kaiser| ( |1986| l, based on simple arguments of self-similarity 
for clusters dominated by gravity, and their derivations have 
been extensively covered in the literature (e.g. 'Bryan & Nor- 
|man||1998| [Maughan et al.||2006| . Here we briefly review 



the standard derivations of self-similar scaling relations, and 
then extend them to build a self-consistent set of relations 
to describe non-self-similar clusters. 

The three main properties of the ICM that are observ- 
able in X-rays, and are expected to scale with cluster mass, 
are the temperature (T), mass (/iQ and luminosity (L) of 
the ICM. We will consider how each of these observables 
are expected to scale with the total mass (M) of a cluster. 



^ We adopt the unusual notation of fi for gas mass rather than 
e.g. A/gas in order to avoid an abundance of subscripts. 



In order to account for the mass dependence of cluster size, 
and the evolving background density field from which clus- 
ters collapse, it is convenient to consider properties within 
an overdensity radius 7?a, which encloses a mean density 
of Apc(z). The use of the critical density at the redshift 
of the cluster as a reference density introduces an expected 
evolution into the resulting scaling relations, parameterised 
through 



E{z) = x/nAf(l + zf -Kl - - A)(l z)2 + A (1) 

and which describes the redshift dependence of the Hubble 
parameter. In the following derivations, all properties are 
implicitly measured within the same radius R/\- 

Assuming that the ICM is in virial equilibrium with the 
cluster gravitational potential, the virial theorem gives 



T (xE{zY'-'M 



2/3 ,_^2/3 



(2) 



We will parameterise this function to allow the slope and 
evolution to differ from the self-similar expectations of the 
virial theorem 

Bt 



T 

To 



= AtE(z) 



M 
Mo 



(3) 



Here At is a constant of proportionality and To and Mq 
are normalisation constants introduced for numerical conve- 
nience later. Throughout this work, we use To = 5 keV and 
Mo = 5 X lO^M© 

For self-similar clusters, the mass of gas (^) in the ICM 
is a constant fraction /gas of the total mass: 

/i = /gasM 



(4) 



This can be rewritten, with the addition of a slope parameter 
to allow a mass dependency, as 



MO 



M 
Mo 



(5) 



and we set the normalisation parameter /io = 5 x IO^^Mq 
throughout this work. As an aside, we note that equation 
^ is equivalent to writing the mass dependency of /gas as 

Su-i 



MO 
Mo 



Au 



M 
Mo 



(6) 



The luminosity of the ICM is dominated by 
bremsstrahlung emission for T >2 keV, where line emission 
is not significant. If we also neglect the weak temperature 
dependence of the Gaunt factor, then the luminosity is given 
by 



L oc 



nl/2 



dV 



(7) 



The integral of the density depends on the distribution of 
the ICM. We follow [Stanek et aL] ( |2010[ ) by factoring the 
density into a mean density and a dimensionless structural 
parameter Q, such that equation ([7| becomes 



L cc E{zfQfl^,T^'''M 



(8) 



In other words, the luminosity of the ICM depends both on 
the amount of gas in the cluster (via /gas), and how that 
gas is distributed (via Q). For self-similar clusters, Q and 
/gas are independent of mass and can be absorbed into the 
proportionality constant. 

It is widespread practice (e.g. Maughan et al.|[2006 l to 
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derive the self-similar relation between luminosity and mass 
{LM relation) by setting Q and /gas to constants and using 
equation ([2| to eliminate T from equation Q. However, this 
does not maximise the observational information, and if self- 
similar behaviour breaks down, it becomes unclear which of 
the mass scalings are being broken. 

Instead, we rewrite equation ([8| to allow for a power- 
law mass dependence of the ICM structure parameter Q 
(moving any constant component into the proportionality 
constant), giving 



(9) 

Although we interpret this slope parameter Bl SlS predom- 
inantly describing mass-dependence of Q, it could also de- 
scribe the effects of the increasing contribution of line emis- 
sion to the luminosity at lower temperatures (< 2 keV), 
which modifies the temperature dependence towards T~^^'^ 
in equation Q. 

Now, rather than use equation ([6| to substitute for /gas, 
let us instead explicitly keep the observed quantities, and 
write our LM relation as 



L 



AlE{z) 



(10) 



Lo ''^ VmoV \To ) 

where 7_t = 2 and Bl = 1 for self-similar clusters, and we 
set I/O = 5 X 10^* erg s"^ . For the purpose of visualising the 
model fits with a simple plot of L against M, it is convenient 
to write the LM relation as 

S,,-2+2B^+Br/2 



= AlAIa}/^E{z 



Lo 



~\7j:,+7t/2 



M 
Mo 



(11) 

Equations (|3|, ([5| and 1 10 1 form a physically-motivated, 
internally consistent description of the scaling relations be- 
tween the key X-ray observables and cluster mass. These 
scaling relations are well-established, but the explicit de- 
pendence of L on T and in equation is not usually 
preserved. We refer to these collectively as the PICACS scal- 
ing relations. 

Finally, the relation between luminosity and tempera- 
ture (LT relation) has long been used as a key observational 
diagnostic of non-gravitational processes in clusters, with de- 
partures from the self-similar form of the LT relation used 
to measure the nature and extent of those processes. How- 
ever, as for the LM relation, the self-similar form of the LT 
relation is usually derived by substituting equation Q into 
equation Q to eliminate M. This results in an LT relation 
of the form 

^^A..E(.)-(-) (12) 

where departures from 7lt = 1 and Blt ~ 2 are taken as 
evidence for similarity breaking. However, this is only true 
if all of the scaling relations between the cluster observables 
and mass are self similar. This is made clear if we write 
the parameters of the LT relation in terms of the PICACS 
scaling relations: 



Alt 
Blt 



AlAIa'^'-'^^' 
2B. + Bl-2 



^LT = 7L 



Bt 

7t 

Bt 



1 

2 



2B^ + Bl-2) 



(13) 
(14) 
(15) 



Thus the slope of the LT relation departs from self-similarity 
if any or all of the slopes of the PICACS scaling relations 
differ from their self-similar values, but measuring the slope 
of the LT relation will not tell us which. Similarly, the self- 
similar evolution of the LT relation differs from 7_tT = 1 
if the evolution or slopes of the PICACS scaling relations 
differ from their self-similar values. In other words, a simple 
measurement of the evolution of the LT relation could imply 
real evolution, when in fact the PICACS scaling relations 
evolved self-similarly, but the slope of one or more were not 
self-similar. 

We thus argue, that the LT relation is a tool that is at 
best blunt, but possibly also inaccurate, for the study of the 
scaling relations - instead we propose the use of PICACS. 
In the following section we present a statistical framework 
that enables the modelling of the cluster population with 
the PICACS scaling relations. 



3 STATISTICAL FRAMEWORK 

Perhaps the most obvious way to measure the PICACS scal- 
ing relations would be to fit each relation independently to 
a sample for which we have observations of T, fi, L, and an 
observationally determined M (e.g. from X-ray hydrostatic 
masses). However, by using Bayesian techniques, it is possi- 
ble to construct a statistical framework to jointly determine 
the probability distributions of the PICACS parameters and 
cluster masses, given the observational data. The following 
treatment was inspired by the Bayesian analysis of cluster 



mass-richness relations in Andreon & Hurn (20101. 

Generically, Bayes' theorem can be used express the 
probability of some model parameters 6i given observational 



p{e,\Dj) oc p{D,\e,)P{e, 



(16) 



The probability on the left hand side is referred to as the 
posterior, while the first term on the right describes the like- 
lihood (£) of the data given the model. The last term de- 
scribes the prior probabilities of the model parameters. 

We can construct the likelihood of the PICACS scaling 
relations and cluster masses in terms of the observables. The 
PICACS scaling relations predict the value of each observ- 
able given a cluster mass, but observed values are expected 
to differ from the model predictions due to the intrinsic scat- 
ter St, S^, Sl of the population about each relation, and the 
statistical scatter described by the measurement errors ctt, 
o'^, (Jl on each observed quantity. In the following, we will 
use the subscripts oba to indicate an observed quantity, mod 
to indicate a quantity predicted by a PICACS scaling re- 
lation, and int to indicate the model prediction including 
intrinsic scatter. 

For example, for a cluster of mass M, we might have an 
observed mass Mobs with error om, and a predicted temper- 
ature Tmod from equation ([3|. The intrinsic scatter in the 
relation St will then randomly shift the temperature to a 
value Tint, which we then observe as Tabs with error ot- 

The likelihood of our observation of Mobs for a cluster 
of mass M is simply given by 



Cm = PiMobs\M,aM) 



(17) 



The likelihood of our observation of Tobs for the same 
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cluster is the product of the probabilities of the cluster being 
scattered to temperature Tint and then observed at temper- 
ature Tobs: 



Ct = -P(Tobs!rint,0-T)-P(rmtiT'mod,'5T) 
= P (Tob. i Tint , (TT ) P (Tint I M, St , 5t ) 



(18) 



where Tmod is the temperature predicted by the TM scaling 
relation in Equation Q , so is a function of M and the scaling 
relation parameters 6t = (tIt, -Bt, 7t)- 

The likelihood of the observation of the gas mass ^ for 
the same cluster is similarly 

= P(/iobs!Mint, 0-fi)P(/iint|Mmod, 5^) 

= P(Mobs|Mmt,Crp)P(/Xint|M,6l^,(5p) (19) 

Again, /imod is given by equation ([5|, and is a function of 
M and the scaling relation parameters 6*^ = (yl^, P^j, 7^). 

To compute the likelihood of our observation of L, we 
must consider the contributions of the intrinsic scatter in 
/i and T to that in L. Thus, rather than use -tvint as an 
independent parameter in our model, we define 

1/2 / ,,r \ -Bl-2 



Lint 



Mint 
MO 



Tint 
To 



M 

Mo 



(20) 

Here we have introduced Qint as an additional intrinsic scat- 
ter term to describe the residual intrinsic scatter in luminos- 
ity at a given mass, once the contributions from the scatter 
in n and T are removed. Qint is distributed as a Gaussian 
in with mean of unity and standard deviation of 5q , so adds 
scatter to L without a systematic contribution. This explic- 
itly separates out the scatter in L due to variations in the 
amount of gas (Sfj,), and the gas temperature (St), from 
that due to other factors (Sq) which are likely dominated 
by structural variations in the ICM, but could also include 
contributions from e.g. variations in the ICM metalicity. 
The likelihood function for L is then 

Cl = P(AobsiAint,0-A)P(LintiA/,6»L,e»T,6l^,5L,5T,5M) (21) 

where Ol ~ {Al, Bl,"/l) are the luminosity scaling relation 
parameters. 

The treatment above assumes that the intrinsic scatter 
and statistical scatter on the observables are all indepen- 
dent. This is likely to be incorrect at some level, as the 
processes driving the intrinsic scatter (e.g. mergers, cooling, 
AGN feedback) will impact all of the ICM observables to 
a greater or lesser extent. The correlation of intrinsic scat- 
ter has not been widely studied, but simulations suggest a 
coherent motion of clusters along the LT relation during 



mergers (e.g. Rowley, Thomas & Kay 2004 Hartley et al, 
20081, implying correlated scatter in the TM and LM rela- 



tions. Observationally, Mantz et al. (2010a I found the cor 



relation of intrinsic scatter in L and T to be consistent with 
zero, albeit without strong constraints. The possibility of 
correlated intrinsic scatter is incorporated into the PIGACS 
model by using a covariance matrix Ct^l to describe the in- 
trinsic scatter. The diagonal terms are 5^,5^,(5^, while the 
off-diagonal terms give the covariances between Pint, Mint and 

Qint • 

The joint likelihood of the intrinsically scattered values 



where the probability distribution is a multivariate Gaussian 
distribution with mean values given by (rmod,Mmod, 1) (i.e. 
the mean Qint is 1) and covariance given by Ct^l. With this 
change, the full covariance matrix becomes a parameter of 
the model. 

Describing the statistical scatter with a single covari- 
ance matrix is not possible, as it will be different for each 
cluster due to the differing statistical errors. Instead, we 
model the covariance of the statistical scatter in terms of 
the correlation coefficients between the observed quantities, 
TTii, ttl, TfiL- The joint likelihood of the observed values of 
each property is then given by 

^obs — P(Tobs , Mobs , Pobs I Pint , Mint ! Lint,DT^L) (23) 

where the probability distribution is a multivariate 
Gaussian with mean Tint , Mint , iint and covariance given 
by Dt^l, which is defined for each cluster with di- 
agonal elements a^,a^,aj^, and off-diagonal elements 
rT^i/ir^TU^i), rTL/{ffT(TL),r^L/{(TfxaL)- Here the a terms are 
different for each cluster (the measurement errors), but the 
correlation coefficients r are in common. The three correla- 
tion coefficients are thus the model parameters describing 
the covariance of statistical scatter. 

For a set of observations of multiple clusters (denoted by 
the index i), we take the product of each likelihood evaluated 
over all clusters. The joint likelihood of our observations is 
then given by 



(24) 



J~-i„t = P(Tint, Mint, Lint\M, 6l, 6T,9fj,, Ct/il) 



(22) 



The posterior probability distribution of our model pa- 
rameters is 

P(6T,0fj,,9L, M, Tint , fJ-int , Qint \ -^'-^obsi Tobs; fJ'ohs-! -^'obs) 

oc C P{eT)P{e^)Pi9L)P{M)P{CT^L)P{rT,.,rTL,r^L) 

(25) 

Note that the cluster mass Af is a parameter of our 
model, as are the intrinsically scattered quantities. The lat- 
ter are simply nuisance parameters that are marginalised 
over, but the appearance of M as a parameter means that 
the PIGACS framework can be used to constrain M given a 
combination of A/obs and/or priors on the form of some or 
all of the scaling relations. A key advantage of this method 
is that all of the observables are being fit against the same 
cluster mass in an internally consistent manner. 

As we will demonstrate later, it is entirely possible to fit 
the PICAGS relations to the observed quantities without an 
observed mass for the clusters being considered. In this case, 
we remove Cm from our likelihood function, but keep M as 
a parameter of the model. A'l can then be constrained by 
the PICAGS scaling relations, but care is needed as strong 
degeneracies arise between the scaling relation parameters 
and between the intrinsic scatter terms. To avoid this, in- 
formative priors are required on a subset of the parameters, 
as discussed in §5.1| 

PIGACS has shares some features and much of its phi- 
losophy with the [Mantz et al.| ( |2010a[ ) approach, but is not 
designed with cosmological analyses in mind, and so cur- 
rently lacks the ability to model selection biases, which is 
a major strength of the [Mantz et al.] ( [2010a[ ) work. On the 
other hand, PIGACS has a more detailed model of the inter- 
dependency of the scaling relations (specifically the inclusion 
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Name 




z 


T 






A/flt 


Reference 








keV 


1O"M0 




IQI^Mq 




A133 





0569 


4.02 ±0.07 


2.63 ±0.32 


3.17 ± 0.38 


2.93 ±0.16 


V06 


A383 





1883 


4.67 ±0.12 


3.79 ±0.38 


3.06 ±0.31 


3.46 ±0.16 


V06 


A478 





0881 


7.70 ±0.12 


9.22 ± 1.21 


7.68 ± 1.01 


7.99 ±0.58 


V06 


A1413 





1429 


7.16 ±0.11 


8.10± 0.81 


7.57 ± 0.76 


7.24 ± 0.43 


V06 


A1795 





0622 


5.94 ± 0.05 


6.27 ± 0.54 


6.03 ±0.52 


5.71 ± 0.31 


V06 


A1991 





0592 


2.53 ±0.06 


1.25 ±0.17 


1.23 ±0.17 


1.36 ±0.10 


V06 


A2029 





0779 


8.22 ±0.09 


9.85 ±0.91 


8.01 ± 0.74 


8.70 ± 0.44 


V06 


A2390 





2302 


8.62 ±0.17 


15.14 ± 1.52 


10.74 ± 1.08 


10.40 ±0.78 


V06 


MKW4 





0199 


1.59 ±0.04 


0.48 ± 0.06 


0.77 ±0.10 


0.66 ±0.05 


V06 


A1983 





0442 


2.08 ± 0.09 


0.64 ± 0.09 


1.09 ±0.37 


0.90 ± 0.10 


A07 


MKW9 





0382 


2.32 ±0.23 


0.49 ±0.05 


0.88 ±0.20 


0.76 ±0.08 


A07 


A2717 





0498 


2.44 ± 0.06 


1.02 ± 0.05 


1.10 ± 0.12 


1.18 ± 0.09 


AO 7 


A2597 





0852 


3.50 ±0.09 


2.51 ± 0.09 


2.22 ±0.22 


2.41 ± 0.15 


A07 


A1068 





1375 


4.46 ±0.11 


3.77 ± 0.10 


3.87 ± 0.28 


3.49 ±0.18 


A07 


PKS0745-191 





1028 


7.61 ± 0.27 


10.71 ± 0.48 


7.27 ± 0.75 


7.88 ±0.44 


A07 


A2204 





1523 


7.89 ±0.21 


10.55 ±0.40 


8.39 ±0.81 


8.14 ± 0.39 


A07 



Table 1. X-ray properties of the VA sample clusters taken from V06 and A07 with the addition of the cluster masses (Mflt) determined 
with PIC ACS (Mobs a-re the hydrostatic from the referenced works). Where the uncertainties on the A07 properties were asymmetric, 
the mean value is used. T values have been rescaled to the [0.15 — 1]-R500 aperture as described in the text. 



of a variable slope in the jj-M relation and the explicit inclu- 
sion of T and jj. in the LM relation). 



3.1 Implementation 

In order to obtain constraints on our model parameters, 
the posterior distribution must be sampled over the large 
parameter space. There are many tools available for this 
generic problem, and the PICACS framework could be im- 
plemented in many ways. Here we note a few of the specifics 
of our implementation. PICACS was developed using the 



R statistical computing environment I ^R Development Core 
Team|[2bl2 1, and the posterior probability distribution was 
analysed using the Bayesian inference package Laplace 's De- 
mon within R (Hall 2012 Laplace 's Demon contains many 
Markov Chain Monte Carlo (MCMC) algorithms designed 
to efficiently sample the posterior probability distribution, 
and the "Adaptive Metropolis-within-Gibbs" algorithm was 
found to be effective at sampling the PICACS posterior dis- 
tribution and converging reliably. We refer the reader to the 



excellent documentation in [Hall (20121 for details of the al- 
gorithm, as well as an introduction to Bayesian inference. 

It is computationally advantageous to rewrite the PI- 
CACS scaling relations in log space (we used logio for con- 
venience). Thus each probability distribution in the likeli- 
hood functions is implemented as a Gaussian distribution in 
logio space. It is also essential to work with the logarithm of 
all probabilities, due to the small numerical values involved, 
and hence we sampled the natural logarithm of the poste- 
rior probability distribution, and all of the products in the 
likelihood and posterior expressions become sums. Unless 
otherwise stated, a flat prior was assumed for all param- 
eters. Three MCMC chains were run in parallel with ran- 
domised initial values and the fits were accepted when the 
three chains had converged (as determined by examination 
of the sample distributions). The probability distribution of 



each parameter was computed from the distribution of sam- 
ples from the chain after removing the start of each chain 
until the parameter values were stationary, and combining 
multiple chains. We summarise the posterior probability dis- 
tribution of each parameter as the mean, plus or minus the 
standard deviation of each distribution. 

In the following sections we apply the PICACS frame- 
work to observational samples to demonstrate its applica- 
bility and effectiveness in several scenarios. 



4 APPLICATION TO CLUSTERS WITH 
OBSERVED MASSES 

As a first test, we apply the PICACS framework to a set of 
clusters with precise mass estimates from X-ray hydrostatic 
analyses, and compare the performance to that of traditional 
BCES regression fits. For this study we used the samples of 
Vikhlinin et al.|(|2006| hereafter V06) and |Arnaud, Pointe-| 
couteau & Pratt (2007 hereafter A07). Both samples target 



relaxed, low-z clusters with high quality data over a reason- 
ably large range in mass, so are well suited to a first test of 
PICACS. Both samples have published Mobs, Tobs and fiobs 
for each cluster, but neither sample has published luminosi- 
ties available. We thus removed jCl and 0l from PICACS, 
leaving Or, 9^, Ct^ (the covariance between Tint and /iint ) , 
rTfi (the correlation between statistical scatter in T and jj.) 
and M as the parameters of interest. 

The two samples were combined, and duplicate clus- 
ters were removed from the A07 sample, and three clusters 
without measurements at i?5oo were excluded from the V06 
sample. In addition, Abell 907 was removed from the com- 



http : //cran . r-pro j ect . org/web/packages/LaplacesDemon 



bined list, as it appears in the REXCESS sample (Bohringer 
|et al.|2007[ ), for which we will be using the constraints from 
this study as independent priors in our subsequent analysis. 
This gave a combined list of 16 clusters at 0.02 < z < 0.23 
with a median z = 0.09, which we refer to as the "VA sam- 
ple". The properties of the sample used for this study are 
summarised in Table [T] 
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Figure 1. Best fitting PICACS fits to the TM (left) and (right) scaUng relations of the VA sample, with the shaded envelope 
indicating the la uncertainty. The A07 clusters have been scaled by the best-fitting Ft and i^p cross-calibration parameters. 



Method 


At 




5t 














PICACS 
BCES 


1.09 ±0.05 
1.11 ± 0.02 


0.60 ± 0.04 
0.61 ±0.03 


0.10 ±0.04 
0.01 ± 0.01 


0.97 ±0.07 


1.10 ±0.09 
1.13 ±0.04 


1.24 ± 0.08 
1.29 ±0.05 


0.17 ±0.09 
0.03 ± 0.03 


0.94 ± 0.12 


0.4 ±0.5 


PICACSt 


1.10 ± 0.02 


0.61 ± 0.02 


0.03 ±0.01 


1.01 ± 0.04 


1.17 ± 0.06 


1.25 ±0.03 


0.01 ± 0.01 


0.97 ± 0.07 





Table 2. Best fitting parameters of the TM and fiM relations fit to the VA sample using the PICACS and BCES methods. For 
convenience, the intrinsic scatter terms are given in natural log space, so are simply fractional values. For the full PICACS fit in the top 
part of the table, the intrinsic scatter values are computed from the diagonal elements of the covariance matrix. The final column PT^^, 
gives the Pearson's correlation for the parameters, derived from the covariance matrix. ^For comparison, we include in the bottom part 
of the table the results obtained when the covariance of statistical and intrinsic scatters is neglected. 



The temperatures from A07 were measured in the 
[0.15 — 0.75]i?5oo aperture so were scaled to the [0.15 — l]i?5oo 
aperture by multiplying them by 0.955, the midpoint of the 
range suggested by A07. Similarly the V06 temperatures 
were rescaled from the 70 kpc — 7?5oo aperture in which they 
were measured, to the [0.15 — l]i?5oo aperture by multiply- 
ing them by 0.97 as recommended in V06. The gas and total 
masses were all measured within JJsoo. 

As the two parent samples were observed with differ- 
ent satellites, Chandra (V06) and XMM-Newton (A07), we 
introduce additional cross-calibration factors, Ft and F^, 
such that the A07 Tobs and /iobs were multiplied by these 
factors respectively. We note that the choice to rescale the 
A07 rather than V06 properties was arbitrary, but as we 
shall see, these factors turn out to be negligible. The ability 
to include additional model components such as these scale 
factors is an advantage of the PICACS approach over BCES 
regression. 

The PICACS models were fit to the VA sample, and 
the resulting TM and /iM scaling relations are plotted in 
Figure [l] The fits were also performed with the standard 
orthogonal BCES method, and with PICACS assuming in- 
dependent scatter in the observables. The model parameters 
obtained with these techniques are summarised in Table [2] 
In these fits, the evolution parameters 7t and 7,1 were fixed 
at their self-similar values due to the small redshift range of 
the sample. 

The PICACS fits agree extremely well with the results 



from the conventional BCES fits, demonstrating that the 
new technique performs well. 



4.1 Covariance and Degeneracies 

Table[2]shows that the measurements of the intrinsic scatter 
in the TM and ^lM relations differ significantly between the 
PICACS and BCES methods. However, the definitions of in- 
trinsic scatter also differ, in that the BCES method does not 



measure scatter itself. Instead, we follow Maughan ( 2007 1 in 



defining the intrinsic scatter as the constant term that must 
to be added to each T (or ^) error bar in quadrature in 
logio space to produce a reduced of unity with respect 
to the BCES regression line. This is not self-consistent, as 
the best fitting BCES model is not the model which min- 
imises the in any direction. Furthermore, this approach 
treats the intrinsic scatter in each relation independently. 
When the intrinsic scatter terms are treated independently 
in PICACS, the agreement with the BCES scatter is good. 

The inclusion of scatter covariance in PICACS does not 
significantly change any model parameters other than the 
scatter terms, but does reduce the precision of the parame- 
ter estimates. This provides a more realistic measurement of 
our uncertainties in fight of possible correlated scatter. The 
covariance matrix Ct/j was used to calculate the Pearson's 
correlation between Tint and /iint, giving = 0.4 ± 0.5. 
Thus these data do not provide useful constraints on the 
correlation of the scatter in these relations. The constraints 
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Figure 2. Correlation matrix of tlie PICACS model parameters for the fit to tlie VA sample. The posterior densities are shown along 
the diagonal, with the pairs of points from the (thinned) samples of the MCMC chain shown on the lower triangle panels. The upper 
triangle panels show the Pearson's correlation coefHcient for the corresponding pair of parameters (with a text size proportional to the 
correlation strength). The <5 terms are in natural log space, and were computed from the square root of the diagonal elements of the 
covariance matrix Ct, and so do not represent the full information in the covariance matrix. 



on the covariance matrix for the intrinsic scatter in the VA 
sample are summarised in Table |3] The data were also not 
able to place useful constraints on the correlation of statisti- 
cal scatter in T and fi; the correlation coefficient was found 
to be = -0.3 ± 0.5 

It should be noted that the clusters in the VA sample 
were selected to be highly relaxed systems to permit reliable 
hydrostatic masses, so the scatter values measured here and 



their covariance may not represent the cluster population at 
large. 

In addition to the covariance in the intrinsic scatter, we 
can also investigate the degeneracies in the PICACS model 
parameters. The correlations between parameters are shown 
in the scatterplot matrix in Figure [2] This indicates that 
there is some degeneracy between the two normalisation 
terms and the two slope terms in the model, which is not 
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Tin 



Mint 



Tint (2.0 ± 1.8) X 10-^ 
Wnt (1.8 ± 3.0) X 10-^ 



(1.8 ±3.0) X 10-3 
(7.0 ±8.2) X 10-3 



Table 3. Covariance matrix Ct^ for the PICACS fit to the VA 
data. The covariance was measured in logj^g space. 



14.8 



14.0 




O 'A07 
A V06 



14.4 

log(M,i,/Msoi) 



Figure 3. X-ray hydrostatic Afo^s is plotted against A/gt from 
the combined PICACS model for the VA sample. The line shows 
equality between the two mass estimates. 

surprising given the dependency of both scahng relations on 
the cluster mass. Otherwise, no strong degeneracies exist in 
the model in this case. 



4.2 Mass constraints 

The best-fitting masses from the PfCACS analysis are given 
in Table [l] and are compared with Mobs in Figure [S] The 
agreement is excellent, which should not be surprising, given 
that Mfit is well-constrained by A/obs- It is interesting to 
note that the uncertainties on Mat are smaller than those 
on Mahs', the median uncertainty on Moha is 10%, while it 
is 8% on Mat- This modest increase in precision comes from 
the fact that we are requiring our best-fit masses to agree 
with both TM and scaling relations and the hydrostatic 
mass measurements. The precision is further increased (to a 
median of 6%) if independent intrinsic scatter is assumed. 



5 APPLICATION TO CLUSTERS WITHOUT 
OBSERVED MASSES 

We now apply the PICACS technique to the REXCESS 
sample, which is a representative set of low-redshift clus- 



ters with high-quality XMM-Newton data (Bohringer et al 



20071. The global X-ray properties (T, L, and /x) of the clus- 
ters were presented in [Pratt et al.| ( [2009| ) along with a study 
of the luminosity scaling relations of the sample. X-ray hy- 
drostatic masses are not currently available for the sample. 
The representative nature of the REXCESS sample, along 
with the limited redshift range and precisely measured X- 
ray properties make it a good choice for a second case study 



of the PICACS methodology. In the following, we use the 
T, L, and values of Pratt et al. ( 2009[ l, measured out to 
7?50o, with the central 15% of J?5oo excluded for T and L. 
As before, the evolution parameters 7t and 7l are fixed at 
their self-similar values. The properties of the clusters used 
in this study are summarised in Table [4] 

In the absence of Mohs for the REXCESS sample, pri- 
ors are needed on a subset of the PICACS scaling relation 
parameters to break the degeneracy between M^t and the 
scaling relation shape parameters. Initially, we will maximise 
the use of the information from the VA sample, and use 
the constraints on At, Bt, and from the PICACS 
analysis summarised in Table [2] (encoded as Gaussian pri- 
ors in linear space for the B terms and logjQ space for the 
A terms). We also use the posterior probability distribution 
on the covariance of T and /i from the VA sample (Table [3| 
as Gaussian priors on the first 2x2 elements of the full 3x3 
Ct/jL covariance matrix, and the VA constraint on rr^ as a 
Gaussian prior in linear space. We refer to this set of priors 
as the "VA priors". Later we will review the success of the 
PICACS method with weaker priors. 

Without A/obs, there is a very strong degeneracy be- 
tween /iint and Lint, as might be expected from equation 
( 20 1. This is discussed in the following section, but for our fit 
to the REXCESS data, Lint was assumed to be independent 
of Tint and /Xint- In other words, the Tint, Lint and /Xint,Lint 
off-diagonal terms of the Ct/jL covariance matrix were fixed 
at zero. This is our reference model for fitting data without 
ALobs, and we refer to it as the "reduced covariance" model. 

The best-fitting PICACS relations to the REXCESS 
sample are plotted in Figure |4j and the parameters are sum- 
marised in Table [5] These plots differ from those of conven- 
tional scaling relations, as the masses plotted are the best- 
fitting masses from the combination of scaling relations, and 
not observed masses. The constraints on the scaling rela- 
tion parameters are consistent with the VA priors, but with 
improved precision (particularly for 5^), and the new con- 
straints on the luminosity scaling parameters are quite pre- 
cise. Note that Bl is consistent with the self-similar value of 
1, which implies no mass scaling of the ICM structural pa- 
rameter. This is discussed in more detail later in the context 
of the LT relation. We will now investigate the degeneracies 
in the model and the sensitivity of the results to the choice 
of priors. 



5.1 Covariance and Degeneracies 

The intrinsic scatter covariance matrix Ct^l from our ref- 
erence fit to the REXCESS data is summarised in Table [S] 
The covariance of Tint and /.tint corresponds to a Pearson's 
correlation coefficient of pr/^ = — 0.05±0.38, consistent with 
no correlation, but with weak constraints. 

The correlations between the statistical scatter of the 
observables were not well constrained (ry^ = —0.2 ± 
0.4, rTL = 0.1±0.5,r^L = -0.1±0.5). We found that ignor- 
ing Dtiil (i.e. setting the r coefficients to zero) has a neg- 
ligible effect on the best fitting parameters or their errors; 
most values did not change at the precision given in Table m 
and the r coefficients had no degeneracies with other model 
parameters. 

In our fit to the REXCESS data, we assumed Qint to be 
independent of Tint and ^int- If we relax that assumption. 
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Name z T fj. L Mat 

keV 1O"M0 lO*"* org s"! 10"M, 



RXCJ00034 


-0203 





092 


3 


83 


± 





10 


1.99 ±0.04 


1.15 ±0.01 


2 


19 


± 





23 


RXCJ0006- 


3443 





115 


5 


24 







20 


4.48 ± 0.11 


3.17 ± 0.05 


4 


61 


± 





53 


RXCJ0020- 


2542 





141 


5 


54 


± 





13 


4.06 ±0.06 


4.05 ±0.03 


3 


69 


± 





38 


RXCJ0049- 


2931 





108 


2 


87 


± 





10 


1.66 ± 0.03 


0.99 ± 0.02 


1 


60 


± 





18 


RXCJ0145- 


5300 





117 


5 


81 


± 





15 


4.85 ±0.06 


3.87 ± 0.03 


4 


93 


± 





47 


RXCJ0211- 


4017 





101 


2 


08 







05 


0.98 ± 0.01 


0.48 ± 0.01 





96 


± 





13 


RXCJ0225- 


2928 





060 


2 


53 







14 


0.73 ±0.02 


0.31 ± 0.01 





88 


± 





13 


RXCJ0345- 


4112 





060 


2 


28 


± 





07 


0.82 ± 0.02 


0.37 ± 0.01 





90 


± 





13 


RXCJ0547- 


3152 





148 


6 


04 


± 





14 


5.94 ± 0.04 


5.73 ±0.04 


5 


46 


± 





50 


RXCJ0605- 


3518 





139 


4 


93 


± 





12 


4.63 ± 0.05 


4.23 ± 0.03 


4 


07 


± 





37 


RXCJ0616- 


4748 





116 


4 


17 


± 





11 


2.86 ±0.04 


1.88 ±0.02 


2 


94 


± 





29 


RXCJ0645- 


5413 





164 


7 


23 


± 





18 


10.08 ± 0.11 


11.33 ± 0.08 


8 


66 


± 





94 


RXCJ08214 


-0112 





082 


2 


81 


± 





10 


1.16 ±0.03 


0.54 ± 0.01 


1 


33 


± 





16 


RXCJ0958- 


1103 





167 


5 


95 







41 


4.43 ± 0.20 


5.21 ± 0.14 


3 


98 


± 





50 


RXCJ1044- 


0704 





134 


3 


58 







05 


3.32 ±0.04 


2.99 ±0.02 


2 


57 


± 





28 


RXCJ1141- 


1216 





119 


3 


58 


± 





06 


2.45 ± 0.02 


1.69 ± 0.01 


2 


31 


± 





23 


RXCJ1236- 


3354 





080 


2 


77 


± 





06 


1.21 ± 0.02 


0.61 ± 0.01 


1 


31 


± 





15 


RXCJ1302- 


0230 





085 


3 


48 


± 





08 


1.80 ± 0.02 


0.83 ± 0.01 


2 


16 


± 





26 


RXCJ1311- 


0120 





183 


8 


67 







12 


10.69 ±0.06 


14.93 ±0.07 


8 


96 


± 





86 


RXCJ15164 


-0005 





118 


4 


68 


± 





10 


3.61 ± 0.04 


2.76 ± 0.02 


3 


48 


± 





32 


RXCJ1516- 


0056 





120 


3 


70 


± 





09 


2.99 ± 0.04 


1.77 ± 0.02 


3 


04 


± 





35 


RXCJ2014- 


2430 





154 


5 


75 







10 


7.19 ± 0.07 


7.47 ± 0.06 


5 


94 


± 





55 


RXCJ2023- 


2056 





056 


2 


72 


± 





09 


1.03 ±0.02 


0.40 ±0.01 


1 


28 


± 





16 


RXCJ2048- 


1750 





147 


5 


06 


± 





11 


5.50 ±0.05 


4.40 ±0.03 


5 


15 


± 





58 


RXCJ2129- 


5048 





080 


3 


84 


± 





14 


2.23 ±0.04 


1.19 ±0.02 


2 


51 


± 





28 


RXCJ2149- 


3041 





118 


3 


48 







07 


2.48 ±0.03 


1.58 ±0.01 


2 


42 


± 





23 


RXCJ2157- 


0747 





058 


2 


79 







07 


1.12 ±0.02 


0.37 ± 0.01 


1 


52 


± 





22 


RXCJ2217- 


3543 





149 


4 


63 







09 


4.37 ± 0.04 


3.69 ±0.03 


3 


93 


± 





37 


RXCJ2218- 


3853 





141 


6 


18 


± 





20 


5.67 ± 0.07 


5.55 ±0.06 


5 


20 


± 





48 


RXCJ2234- 


3744 





151 


7 


32 


± 





12 


9.87 ± 0.10 


12.28 ±0.10 


8 


00 


± 





79 


RXCJ2319- 


7313 





098 


2 


56 


± 





07 


1.74 ± 0.03 


0.97 ± 0.01 


1 


62 


± 





18 



Table 4. X-ray properties of the REXCESS clusters taken from [Pratt et ak] l |2009[ l with the addition of the cluster masses determined 
from the PICACS fits with VA priors and the reduced covariance model. Where the uncertainties on the measured REXCESS properties 
were asymmetric, the mean value is used. Properties are determined within /Jsooi with the central O.I5R500 excluded for L and T. 




Figure 4. Best fitting TM (left), /jM (centre), and LM (right) scaling relations for the REXCESS sample, with the shaded envelope 
indicating the Itr uncertainty. The evolution parameters were fixed at their self-similar values, ■jt = 2/3, 7l = 2. Note that in these 
plots, the mass of each cluster is A/gt, the mass determined by the combined PICACS fit of the masses and scaling relations. 



and fit the full CthL covariance matrix, a strong correlation 
between Qint and Hint emerges, with p^Q = 0.96±0.03, which 
results in inflated (but strongly anti-correlated) values of 5^, 
and 5q. The reason for this is simply that without Mobs to 
help constrain /iint, Qint and fj,int are strongly degenerate 
in equation ( |20[ | . An advantage of the PICACS approach is 
that the expected physical covariance of L with T and fi are 
factored out, so we may look for covariance in the residual 



scatter in L. As we have seen, this is not possible with the 
current data, but with the addition of Afobs for the clusters, 
the degeneracies should be broken by virtue of the stronger 
constraints on /iint that would come from the /iM relation. 

Importantly, with the exception of 5^ and 5q, none of 
the scaling relation parameters in Table [5] are significantly 
affected by the inclusion (or not) of covariance in the intrin- 
sic or statistical scatter of the observables. The results are 
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Method 


At 




St 




i3„ 




Al 


Bl 






priors 


1.09 ±0.04 


0.60 ± 0.04 




1.10 ± 0.09 


1.24 ±0.08 












fit 


1.09 ±0.03 


0.57 ± 0.03 


0.09 ±0.02 


1.13 ± 0.06 


1.17 ±0.06 


0.13 ±0.04 


0.68 ±0.04 


1.01 ±0.06 


0.10 ±0.03 


0.09 



Table 5. Best fitting parameters of the PICACS scaling relations fit to the REXCESS sample using the reduced covariance model (i.e. 
only modelling covariance in the intrinsic scatter of T and /i, but including covariance in the statistical scatter of T, /i, and L). In the 
top section of the table, the full VA priors are used. In the middle section, the priors on the TM relation are removed, and in the bottom 
section, weaker self-similar priors are used on the slopes of the TM and /xM relations. All priors were encoded as Gaussian distributions 
with the specified mean and standard deviation, though the priors on the A terms, while reported in linear space, were implemented 
as Gaussians in logjQ space. For convenience, the intrinsic scatter terms are computed from the diagonal elements of of the covariance 
matrix in natural log space, so represent fractional scatter. The final column, 5-jv/> gives the median fractional error on Mflt. Evolution 
terms were fixed at the self-similar values. 



Tint 


Atint 


Qint 


Tint (1.8 ± 0.8) X 10-^ 


(0.1 ± 1.0) X 10-^ 





Afint (0.1 ± 1.0) X 10-^ 


(3.6 ±2.0) X 10-3 





Qint 





(2.0 ± 1.2) X 10-3 



Table 6. Covariance matrix Ct^l fof the PICACS fit to the REXCESS data. The off-diagonal Q terms were fixed at zero. The 
covariance was measured in logjg space. 



thus robust against the details of this covariance modelling, 
though precision on all terms, including the mass measure- 
ments is improved when using the reduced covariance model. 

In Figure [5] the correlations of the model parameters for 
the reference fit to the REXCESS data are plotted. Unsur- 
prisingly, strong degeneracies exist between the normalisa- 
tions and between the slopes of the scaling relations. This 
is due to the mass of each cluster being a free parameter 
in each scaling relation. We also see that even when Qint 
is independent from the other scatter terms, a degeneracy 
is present between the magnitudes of the scatter, 5^ and 
5q. As above, this is due to the strong dependency of the 
observed luminosity on the baryon fraction, which is only 
partially broken by the VA prior on the T and ^ terms of 
the Cti^l covariance matrix. Without additional informa- 
tion (e.g. from Mobs) it is not possible to constrain 5^ and 
5q independently. 

Interestingly, Table |5] suggests that the magnitudes of 
the three intrinsic scatter terms in the PICACS model are 
approximately equal for the REXCESS clusters. However, 
the strength of this conclusion is also limited by our current 
inability to break the degeneracy between 5^ and 5q. Recall 
that this does not imply that the intrinsic scatter in L is 
similar in size to that in T or /x, as the overall intrinsic 
scatter in L includes contributions from all three terms. 

We investigated the sensitivity of the PICACS approach 
to the number of priors by removing the VA priors on the 
TM relation, but keeping those on the /iM relation, and on 
the T, n components of CxfiL ■ In this case, the degeneracies 
between the slope parameters seen in Figure|5]were stronger, 
and the fit did not become formally stationary, with the 
slope parameters moving coherently around on these lines of 
degeneracy. However, various samples taken from the chain 
showed that all parameters remained within ~ la of their 
values when the full VA priors were used. We thus recom- 
mend that priors on two of the three relations are used for 
analyses where Mobs are not available for at least some of 
the clusters. 



6 PICACS MASS ESTIMATES 

A useful application of PICACS is the estimation of masses 
for clusters without Mobs- The best fitting masses are auto- 
matically estimated as part of the Bayesian inference pro- 
cess, and provide masses that are fully consistent with the 
observed properties and derived scaling relations. In !|4] we 
saw that PICACS made a modest improvement to the pre- 
cision of the hydrostatic mass estimates for the VA sample. 
In this section we evaluate the performance of PICACS at 
constraining the unknown masses of the REXCESS sample. 
The best-fitting PICACS masses from our reduced covari- 
ance fit are given in Table |4j and the median precision is 
9%. 

The conventional way to estimate X-ray masses for a 
sample of clusters such as this, in the absence of hydro- 
static masses, is to use a single scaling relation to estimate 
the mass from a single observable (e.g. T, p), simple com- 
binations of observables such as Yx (defined as the product 
of T and fi; Kravtsov, VikhUnin & Nagai 2006 1 or more 
generalised combinations of observables ( Ettori et al.|2012 1 . 
Typically, when doing this, only the statistical errors on the 
observable are propagated to the mass estimate (Mflt), or at 
best, the uncertainties on the shape parameters of the scal- 
ing relation are also propagated. Generally, the contribution 
from the intrinsic scatter in the relation is ignored, but this 
may be a significant contributor when the statistical errors 
on the observable are small (e.g. for the REXCESS sample, 
the median statistical error on /i is 1%). It is straightforward, 
using a Bayesian approach to include the intrinsic scatter 
and all of the uncertainties on the final mass estimate. 

Let us define a generic scaling relation between mass 
and some observable (or combination of observables) X. Us- 
ing our previous notation, we have 



X 
Xo 



AxE{zy 



Mo) 



(26) 
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Figure 5. Correlation matrix of tiie PICACS model parameters for the fit to the REXCESS sample with the reduced coavariance model 
(i.e. covariance between Tjut and Hint, but not Qint)- The posterior densities are shown along the diagonal, with the pairs of points from 
the (thinned) samples of the MCMC chain shown on the lower triangle panels. The upper triangle panels show the magnitude of the 
Pearson's correlation coefficient for the corresponding pair of parameters (with a text size proportional to the correlation strength). The 
5 terms are in natural log space, and were computed from the square root of the diagonal elements of the covariance matrix Ct^l ^^nd 
so do not represent the full information in the covariance matrix. 



and our likelihood function is 

Cx = P(XohB\Xint,o-x)P{Xint\Xmod,5x) (27) 
= P{X,i,,\Xint,(Jx)P{Xint\M,ex,5x) (28) 

The posterior probability distribution of the model param- 
eters is then 

P(ex, Af,Xint|Xobs) oc CxP{ex)P{M)P(Xint) (29) 



The priors on 6x (denoting Ax , Bx , Jx ) and 5x are taken 
from the scaling relation to be applied. In most cases, yx 
will be fixed (i.e. at a self-similar value) and 5x may not 
have measurement errors, though neither of these factors are 
limitations of the Bayesian approach. In equation ( |29| l, we 
do not expect significant additional constraints to be placed 
on P{ex\Xohs) or P{Xint\Xohs), but P{M) wiU be jointly 
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Figure 6. Posterior probability contours for and for the 
reduced covariance model with VA priors. Contours are set at the 
Icr, 2a and So- levels and the dotted lines shows the self-similar 
values of = 1, = 1. 



constrained by the priors on those terms and by Xobs, and 
fully marginalised over all of the uncertainties. 

We apply this method to compare the precision of the 
PICACS mass estimates for the REXCESS sample with 
those obtained from a single scaling relation. We use the 
YxM relation of A07, for which the intrinsic scatter was 
given as 0.039 in logio space, with no uncertainties provided. 
We convert this to the intrinsic scatter in Yx by dividing by 
the slope of the A07 YxM relation, and transform to nat- 
ural log space to give 5yx ~ 0.16. Including this intrinsic 
scatter, and the errors on Ayx , and Yx , we find a me- 
dian uncertainty on the REXCESS masses of 10%. Neglect- 
ing the intrinsic scatter results in a median mass precision 
of 4%, while including an uncertainty on 5yx of the form 
Syx = 0.16 ± 0.05 (a reasonable estimate based on our fits 
to the VA data) slightly increases the median uncertainty to 
11%. Recall that the median PICACS mass uncertainty for 
the same clusters was 9%. We thus find that in the absence 
of Mobs, PICACS provides mass estimates of equivalent or 
marginally improved precision compared to a single scaling 
relation, and has the advantage of providing masses that are 
simultaneously consistent with all of the observables. 



7 THE PICACS LT RELATION. 

We will now examine the LT relation predicted by the PI- 
CACS fit to the REXCESS data. Recall that we do not fit 
the data directly in the LT plane, but the form of the LT 
relation is given by the self-consistent PICACS models (see 
equations ([13} , ([n|, ([l5|). In Figure [t] we plot the REX- 
CESS data in the LT plane, along with the PICACS LT 
relation. Also plotted is the BCES orthogonal regression fit 
to the data in the LT plane. The REXCESS luminosities 
were scaled by PICACS evolution parameter 7lt for the 
plot, and this is included in the BCES fit (it is automati- 



^-44.5 

I 



7^ 



0.6 

log(T/keV) 



Figure 7. The LT relation predicted by the PICACS fit to the 
REXCESS mass scaling relations is plotted with the observed 
REXCESS properties. The solid line shows the PICACS model, 
which is not fit directly to the data in this LT plane, with the 
grey envelope giving the Icr uncertainty. The dashed line shows 
the BCES orthogonal regression as fit to the data in this plot. 
Luminosities are scaled by the best fitting PICACS -fLT evolution 
parameter (see text for details), and L and T were measured in 
the [0.15 — l]-R5oo aperture. 



Method 


Alt 


Blt 


7LT 


PICACS 


0.70 ±0.05 


2.91 ±0.16 


0.39 ±0.11 


BCES 


0.72 ±0.04 


3.05 ±0.17 


0.39 



Table 7. Best fitting parameters of the LT relation of the REX- 
CESS sample predicted by the PICACS fit and determined from 
a BCES orthogonal regression to the LT data. The evolution 
parameter 7£t does not include any redshift-dependence of the 
cluster properties, but gives the self-similar evolution including 
the dependency on the slopes of the mass observable scaling rela- 
tions in equation | |15| l. This value was fixed for the BCES fit. See 
the text for a full discussion of the evolution parameter. 



cally part of the PICACS fit). Note that in the current fit, 
7lt does not include any redshift-dependence of the ob- 
served cluster properties, but gives the expected self-similar 
evolution of the LT relation when the dependency on the 
slopes of the mass observable scaling relations in equation 
( 15 1 is included. The agreement between the PICACS LT 
relation and the BCES fit is excellent; the parameters of the 
LT models are summarised in Table [T] 

In fact, while it is reassuring that the PICACS method 
is able to reproduce the observed LT relation, this should 
not surprise us; it simply demonstrates that the three PI- 
CACS scaling relations form an internally consistent descrip- 
tion of the observed properties. Note that the good agree- 
ment with the observed LT relation does not necessarily 
indicate that the individual scaling relations are a good de- 
scription of the clusters. For instance, if no prior is included 
on the slope of any of the scaling relations, the degeneracy 
of the slopes leads to unphysical values for Mgt and the 
slope parameters. However the internally-consistent scaling 
relations means that the combination of A and B param- 
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eters remains such that the observed LT relations is still 
reproduced reasonably well. In other words, in the PICACS 
framework, the observed form of the LT relation is a neces- 
sary consequence of requiring the observables to be related 
to the same masses through power law relations, but is not 
sensitive to the form of those relations. 



7.1 The slope of the LT relation 

The advantage of the PICACS method is that while the 
BCES fit simply tells us that the slope of the LT relation 
is steeper than the self-similar expectation of Blt = 2, PI- 
CACS enables us to decompose this into the separate mass 
scaling relations. Table [5] shows that Bl is consistent with 
unity. Recall that this parameter describes the additional 
steepening of the luminosity mass relation, beyond that due 
to the mass dependency of T and fj,. The PICACS results 
thus show that (given the VA priors) the steep slope of the 
REXCESS LT relation is consistent with the departures 
from self similarity in T and fi alone, with no additional 
contribution from L. This indicates that the ICM structure 
parameter Q has no mass dependency. A similar conclusion 
on the lack of mass dependency of the ICM structure param- 



eter Q was reached by Pratt et al. (20091, who estimated Q 



directly from the REXCESS gas density profiles. 



In equation ( 14 1 , we see the dependency of the LT slope 



on the slopes of the mass scaling relations. For the best fit- 
ting values for the REXCESS data, we find that the shal- 
lower than self-similar slope of the TM relation, and the 
steeper than self-similar slope of the relation both con- 
tribute to the steepening of the LT relation, with the effect 
of each in the ratio of roughly 1 : 2. A physical interpreta- 
tion of these results is that extra heating of gas in low mass 
clusters and depletion of gas within -R500 in low mass clus- 
ters are both important (and perhaps physically connected) 
processes, while, the restructuring of the ICM within iisoo 
has no significant contribution to the steepening of the LT 
relation. This conclusion should, however, be treated with a 
little caution. Figure [5] shows the confidence contours from 
the PICACS fit in the B^,Bl plane; taking the parameter 
degeneracy into account, the data can only exclude > 1 
at the ~ 2a level. Furthermore, as we have seen, the break- 
ing of the degeneracy between _B,, and -Bl is sensitive to the 
choice of prior. 

Our measurements of the steeper fj,M and shallower 
TAl relation compared to self-similar expectations are not 
unique - for example, A07 found Bt = 0.58 ± 0.03 and 
B^ = 1.25 ± 0.06. The power of the PICACS approach al- 
lows us to put these pieces together, and bring in additional 
information about the ICM structure through the LM rela- 
tion. 



7.2 The Evolution of the LT relation 

The standard approach to testing evolution of the LT rela- 
tion is to use equation (1121) and define 7/,^ = 1 as the ref- 



erence point for self-similar evolution (e.g. Maughan et al 
2006). However, this only holds true if the slopes of all of 



of the LT relation is not 7lt = 1 but is given by equation 
(15 1. Of course, this does not imply that the slopes of the 
scaling relations influence the evolution of clusters, it is sim- 
ply a consequence of algebraic manipulations used to derive 
the LT relation. 

If applied to a sample covering a significant redshift 
baseline, PICACS can be used to fit the evolution of all scal- 
ing relations self-consistently with their slopes, providing a 
true measurement of their evolution. We reserve this investi- 
gation for a future paper, as measuring the evolution of the 
scaling relations also requires modelling of sample selection 
functions to avoid biases masking or mimicking real evolu- 
tion. This is quite possible within the PICACS framework. 



along the lines laid out by Mantz et al. (2010b I. 



For the current study, we simply note that the PICACS 
fit to the REXCESS data give the self-similar evolution of 
the LT relation as "/lt = 0.39 ± 0.11, significantly weaker 
than the naive expectation of 7lt = 1. This should be taken 
into account when establishing a reference self-similar evo- 
lution against which to measure deviations. For example, 
using this PICACS reference for the self-similar LT evolu- 
tion reduces the significance of the weaker than self-similar 



the mass scaling relations of T, fj, and L are self-similar. In 
the event that they are not, which is suggested by many ob- 
servational studies, then the expected self-similar evolution 



(or negative) evolution measured by recent studies ( Reichert 
et al.|2011[ [Hilton et al.|2012| ). Those results remain statis- 
tically significant compared to our weaker self-similar refer- 
ence, but the most robust measurements of the evolution will 
come from a full PICACS analysis of the cluster population 
to high redshift. 



8 CAUTIONS AND CAVEATS 

As is clear from Figure [5] strong degeneracies exist in the 
PICACS model when there are not direct observational con- 
straints on the cluster masses. Not shown in the correlation 
matrix are the degeneracies between the scaling relation pa- 
rameters and the fitted cluster masses. These are mitigated 
with the use of priors on the scaling relations, but without 
any informative priors, the degeneracy is total; it would be 
quite possible for masses to fit to unphysical values, and for 
the normalisations and slopes and scatters of the relations 
to adjust to compensate. Without Mobs for at least a subset 
of clusters, the PICACS fits are highly dependent on the 
choice of priors for a subset of the scaling relation shape 
parameters. 

In the case of our analysis of the REXCESS sample, 
the VA priors were derived from a sample of relaxed clus- 
ters, while the REXCESS clusters encompass a representa- 
tive range of dynamical states. This difference is unavoid- 
able if X-ray hydrostatic masses are to be used as Mobs, but 
could give rise to systematic effects in the derived REXCESS 
scaling relations and e.g. our conclusions on the relative con- 
tributions of the mass scaling relations to the steepening of 
the LT relation. The most robust PICACS analysis of rep- 
resentative samples like REXCESS would require mass con- 
straints from a techniques such as gravitational lensing or 
caustic analyses which are insensitive to cluster dynamical 
state. 

In its current form, PICACS does not include several 
factors which could affect the measured scaling relations 
and masses. The most significant of these is the modelling 



of Malmquist and Eddington biases (see e.g. Allen, Evrard 
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|fc Mantzl|2011| for a discussion in the context of scaling re- 
lations). These biases can affect both the shape and evolu- 
tion of the mass scaling relations in X-ray selected samples. 
The principal effect is a bias towards clusters with higher 
than average luminosity for a given mass. Full treatment of 
these effects require knowledge of the survey selection func- 
tion and the mass function describing the population from 
which the clusters were sampled. |Mantz et al. ( 2010b|a" l have 
demonstrated how to include this in a self-consistent analysis 
of cosmological parameters and scaling relations. Extending 
PICACS along those lines will enable us to remove any ef- 
fects of bias in luminosity in the current analysis, while the 
modelling of covariance between the scatter terms provides 
a natural way to propagate the effects of the bias through to 
the other scaling relations. For the current study, the REX- 
CESS selection function is known, but the VA sample has 
no selection function (due to the cherry-picking of relaxed 
clusters for hydrostatic masses). This means that a bias cor- 
rection could only be approximate as the PICACS analysis 
of the REXCESS clusters depends strongly on the VA priors. 
We also currently do not model the uncertainty on the 



measurement errors (as in Andreon & Ifurn 20101. This is 
not expected to have a large effect on the current results due 
to the relatively high precision on the observables, but could 
be important when modelling data with larger measurement 
errors, and could plausibly affect the determination of the 
magnitude and covariance of the intrinsic scatter. 



9 SUMMARY AND CONCLUSIONS 

We have introduced PICACS, an internally consistent phys- 
ical model for the analysis of galaxy cluster mass scaling 
relations, and a Bayesian framework with which to imple- 
ment it. PICACS provides a self-consistent set of constraints 
on the parameters describing the shape, scatter and evolu- 
tion of the scaling relations and on the masses of individual 
clusters. It may be used to study the scaling properties of 
clusters with observed masses, estimate the masses of clus- 
ters without observed masses, or a combination of the two. 
The new method was demonstrated on several observational 
datasets, and the key results were as follows: 

• A PICACS analysis of the VA sample of relaxed clusters 
with precise X-ray hydrostatic masses was used to measure 
the shape and scatter of the TM and scaling relations, 
producing result in excellent agreement with traditional re- 
gression methods. 

• Our analysis of the REXCESS sample of clusters, which 
lacks hydrostatic mass estimates, utilised priors from the VA 
analysis and was able to jointly constrain the scaling rela- 
tions of T, ^ and L with mass, and provide mass estimates 
for the clusters. 

• The analysis included modelling of the covariance be- 
tween intrinsic scatter and statistical scatter of the observ- 
ables, and the data suggested a positive correlation in the in- 
trinsic scatter of T and ^, but the evidence was weak. None 
of the other covariance terms were well constrained, and 
with the exception of the intrinsic scatter measurements, 
the model parameters were insensitive to the inclusion of 
covariance modelling. 

• For clusters with hydrostatic mass estimates, PICACS 
was found to make a modest improvement to the precision of 



the mass estimates. For clusters without hydrostatic masses, 
the precision of the PICACS mass estimates was equivalent 
to, or slightly improved compared with that obtained with 
a single scaling relation. 

• The PICACS analysis of the REXCESS sample showed 
that the steep observed slope of the LT relation is consistent 
with roughly equal contributions from heating and depletion 
of the ICM from within -R500, and not due to structural 
variations of the ICM inside -R500. 

• The PICACS framework fully accounts for the effect of 
the scaling relation slopes on the expected self-similar evolu- 
tion of the LT relation, and we show that the expected evo- 
lution is significantly weaker than is usually assumed when 
this effect is ignored. 

• The PICACS framework does not provide something for 
nothing - strong degeneracies exist within PICACS which 
must be broken with informative priors on the forms of two 
of the three mass scaling relations, or with mass estimates 
for some of the individual clusters. 

In common with the self-consistent modelling of the 
scaling relations and cluster mass function of |Mantz et al.| 
( |2010b|a| l, PICACS represents a new way of thinking about 
the galaxy cluster scaling relations. The PICACS technique 
has many potential applications: 

• It can be used to give robust measurement of the evolu- 
tion of cluster scaling relations. This requires the extension 
of PICACS to incorporate selection functions, which will be 
the subject of a forthcoming paper. 

• The PICACS scaling relations for the REXCESS data 
provide a self-consistent description of that representative 
cluster population. This will allow for useful comparisons 
with simulated cluster populations - in order to provide a 
good description of real clusters, the simulated populations 
should match all three of the PICACS scaling relations. 

• The PICACS framework is trivially extendable to in- 
corporate additional observational data for clusters. Essen- 
tially any cluster observable that is expected to correlate 
with cluster mass (e.g. gravitational lensing mass estimates, 
Sunyaev-Zel'dovich effect signals, galaxy richness and dy- 
namics) can be added to the framework. This will provide 
a natural way to test the self-consistency of the different 
cluster mass estimators, as well as maximise the precision 
of the mass constraints by combining all of the available 
information. 
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